Search results for "Lexical database"

showing 6 items of 6 documents

Sub-symbolic Encoding of Words

2003

A new methodology for sub-symbolic semantic encoding of words is presented. The methodology uses the WordNet lexical database and an ad hoc modified Sammon algorithm to associate a vector to each word in a semantic n-space. All words have been grouped according to the WordNet lexicographers’ files classification criteria: these groups have been called lexical sets. The word vector is composed by two parts: the first one, takes into account the belonging of the word to one of these lexical sets; the second one is related to the meaning of the word and it is responsible for distinguishing the word among the other ones of the same lexical set. The application of the proposed technique over all…

Computer sciencebusiness.industryLatent semantic analysisWordNetLexical databaseSemanticscomputer.software_genreLexical setLexical itemLexicographySyntactic categoryArtificial intelligencebusinesscomputerNatural languageWord (computer architecture)Natural language processing

researchProduct

Wordnet and semidiscrete decomposition for sub-symbolic representation of words

2009

A methodology for sub-symbolic semantic encoding of words is presented. The methodology uses the standard, semantically highly-structured WordNet lexical database and the SemiDiscrete matrix Decomposition to obtain a vector representation with low memory requirements in a semantic n-space. The application of the proposed algorithm over all the WordNet words would lead to a useful tool for the sub-symbolic processing of texts.

Information retrievalComputer sciencebusiness.industryWordNetDecomposition (computer science)Artificial intelligenceRepresentation (mathematics)computer.software_genrebusinessLexical databasecomputerNatural language processingMatrix decomposition

researchProduct

LEXOP: a lexical database providing orthography-phonology statistics for French monosyllabic words.

1999

During the last 20 years, psycholinguistic research has identified many variables that influence reading and spelling processes. We describe a new computerized lexical database, LEXOP, which provides quantitative descriptors about the relations between orthography and phonology for French monosyllabic words. Three main classes of variables are considered: consistency of print-to-sound and sound-to-print associations, frequency of orthography-phonology correspondences, and word neighborhood characteristics.

Lexical densitymedia_common.quotation_subjectStatistics as TopicDictionaries as TopicExperimental and Cognitive PsychologyLexical databasecomputer.software_genrePsycholinguisticsConsistency (database systems)Reading (process)General Psychologymedia_commonLanguagePsycholinguisticsbusiness.industryPhonologySciences bio-médicales et agricolesSpellingDatabases as TopicPsychology (miscellaneous)Artificial intelligenceFrancePsychologybusinesscomputerOrthographyNatural language processing

researchProduct

On the advantages of word-frequency and contextual diversity measures extracted from subtitles: the case of Portuguese

2015

Accepted manuscript. Epub ahead of print, 29 Sep. 2014.

MaleAdolescentPhysiologyComputer scienceDecision MakingMotion PerceptionSocial SciencesExperimental and Cognitive PsychologyLexical databaseVocabularySubtitlesYoung AdultPhysiology (medical)Reaction TimeHumansCorpus basedWord frequencyGeneral PsychologyScience & TechnologyPortugalPortugueseContextual diversityGeneral MedicineLinguisticslanguage.human_languageSemanticsWord lists by frequencyNeuropsychology and Physiological PsychologyReadinglanguageRegression AnalysisFemalePortuguesePhotic StimulationPsychomotor PerformanceContextual diversity

researchProduct

Miten viittomakielen korpusta luodaan ja mihin sitä tarvitaan? Viittomakielten korpukset ja niiden tehtävät

2020

Artikkeli käsittelee suomalaisen ja suomenruotsalaisen viittomakielen korpusten luontia CFINSL-projektissa (Corpus project of Finland’s sign languages, Suomen viittomakielten korpusprojekti). Viittomakielillä ei ole kirjoitettua muotoa, joten korpusten laatiminen vaatii erilaista lähestymistä kuin korpusten luonti sellaisille puhutuille kielille, joilla on kirjoitettu muoto. Artikkelissa kuvataan ne menetelmät, joilla Jyväskylän yliopiston viittomakielen keskuksessa on koottu aineistoa suomalaisen ja suomenruotsalaisen viittomakielen korpukseen. Lisäksi kuvataan korpusaineiston teknistä käsittelyä, annotointia, metatietojen keruuta ja käsittelyä sekä aineiston säilytystä ja tutkijoiden käyt…

SignbankComputer sciencebusiness.industrySign languagecomputer.software_genreLexical databaseMetadataAnnotationsuomenruotsalainen viittomakieliviittomakielien korpussuomalainen viittomakieliGeneral Earth and Planetary ScienceskorpuksetArtificial intelligenceleksikkotietokantabusinesscomputerNatural language processingannotaatioGeneral Environmental ScienceSign (mathematics)Puhe ja kieli

researchProduct

GreekLex 2: A comprehensive lexical database with part-of-speech, syllabic, phonological, and stress information

2017

Databases containing lexical properties on any given orthography are crucial for psycholinguistic research. In the last ten years, a number of lexical databases have been developed for Greek. However, these lack important part-of-speech information. Furthermore, the need for alternative procedures for calculating syllabic measurements and stress information, as well as combination of several metrics to investigate linguistic properties of the Greek language are highlighted. To address these issues, we present a new extensive lexical database of Modern Greek (GreekLex 2) with part-of-speech information for each word and accurate syllabification and orthographic information predictive of stre…

VocabularyDatabases FactualComputer scienceSocial Scienceslcsh:Medicinecomputer.software_genreLexical databaseVocabulary0302 clinical medicinePsychologylcsh:ScienceLanguagemedia_commonPsycholinguisticsMultidisciplinaryGreeceSyllabification05 social sciencesModern GreekSyllablesPhoneticsGreek languagePhysical SciencesSyllabic verseSyllableNatural language processingResearch ArticleStatistical Distributionsmedia_common.quotation_subjectDNA transcriptionGrammatical categoryPhonology050105 experimental psychology03 medical and health sciencesPhoneticsGeneticsHumansSpeech0501 psychology and cognitive sciencesVowelsbusiness.industrylcsh:RPhonetic transcriptionCognitive PsychologyBiology and Life SciencesLinguisticsProbability TheoryPart of speechCognitive Sciencelcsh:QGene expressionArtificial intelligencebusinesscomputerMathematics030217 neurology & neurosurgeryOrthographyNeurosciencePLOS ONE

researchProduct